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ABSTRACT 

The radial velocities of ~ 1800 nearby Sun-like stars are currently being mon- 
itored by eight high-sensitivity Doppler exoplanet surveys. Approximately 90 of 
these stars have been found to host exoplanets massive enough to be detectable. 
Thus at least ~ 5% of target stars possess planets. If we limit our analysis to 
target stars that have been monitored the longest (~ 15 years), ~ 11% possess 
planets. If we limit our analysis to stars monitored the longest and whose low 
surface activity allow the most precise velocity measurements, ~ 25% possess 
planets. By identifying trends of the exoplanet mass and period distributions 
in a sub-sample of exoplanets less-biased by selection effects, and linearly ex- 
trapolating these trends into regions of parameter space that have not yet been 
completely sampled, we find at least ~ 9% of Sun-like stars have planets in 
the mass and orbital period ranges Msin % > 0.3Mj upiter and P < 13 years, 
and at least ~ 22% have planets in the larger range Msin % > 0.1Mj up it er and 
P < 60 years. Even this larger area of the log mass - log period plane is less than 
20% of the area occupied by our planetary system, suggesting that this estimate 
is still a lower limit to the true fraction of Sun-like stars with planets, which may 
be as large as ~ 100%. 

Subject headings: Planetology, Planetary systems, Extrasolar Planets 



1. Introduction 

With increasingly sensitive instruments exoplanet hunters have detected more than 100 
exoplanets. The focus of these pioneering efforts has been to find and describe new exo- 
planets. As more exoplanets have been found, the question: 'What fraction of stars have 
planets?' has been looked at periodically. Estimates of the fraction of stars with planets 
can be simply calculated from the raw numbers of exoplanet hosts divided by the number of 
monitored stars. For example, Marcy & Butler (2000) report "5% harbor companions of 0.5 
to 8Mj upiter within 3 AU". Estimates can also be based on high precision Doppler targets 
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in a single survey. Fischer et al. (2003) report a fraction of 15% for high precision Doppler 
targets in the original Lick sample. Estimates can also be based on a semi-empirical analysis 
of exoplanet data (Tabachnik and Tremaine 2002). 

In this work we use a semi-empirical method, staying as close to the exoplanet data as 
possible. We use the growth and current levels of exoplanet detection to verify that sensitive 
and long duration surveys have been finding more planets in a predictable way. This is no 
surprise. However, quantitatively following the consistent increase of the lower limit to the 
fraction of stars with detected planets is an important new way to substantiate both current 
and future estimates of this fraction. Because of the growing importance of the question 
'What fraction of stars have planets?' and our increasing ability to answer this question, 
such a closer scrutiny of (i) the assumptions used to arrive at the answer, (ii) the parameter 
space in which they are valid and (iii) the associated error bars, is timely. 

If the Sun were among the target stars of the Doppler surveys, it would be another few 
years before the presence of Jupiter (the most easily detected feature of our planetary system) 
could be confidently detected. However, we are beginning to be able to answer the question: 
How typical is Jupiter? In Lineweaver & Grether (2002) we found that extrapolations of 
trends found in a less-biased sub-sample of exoplanet data indicate that Jupiter is a typical 
massive planet in the sense that it lies in the most densely occupied region of the log mass - 
log period plane. In Lineweaver et al. (2003) we updated our analysis to include exoplanets 
detected between January and August 2002 and found more support for this idea. However, 
in this previous work we made no effort to determine the absolute frequency of Jupiters or 
to answer the question: What fraction of stars have planets? That is the main focus of this 
paper. 

In Section 2 we present the exoplanet data set (Fig. 1). We analyze the target lists and 
detections and show how the fraction of stars with detected planets has increased over time 
(Figs. 2 & 3). In Section 3 we quantify how the fraction of stars with planets depends on 
the precision with which the radial velocity of individual stars can be measured (Fig. 4). 
In Section 4 we quantify trends in exoplanet mass and period by linear and power-law fits 
to histograms of a less-biased sub-sample of exoplanets (Fig. 5). Based on these trends, we 
extrapolate into larger regions of parameter space and give estimates for the fraction of stars 
with planets in these regions. In Section 5 we summarize, discuss and compare our results 
with previous work. Our approach is most similar and complimentary to the work of Fischer 
et al. (2003) and Tabachnik & Tremaine (2002). 
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Fig. 1. — Our Solar System compared to the 106 exoplanets detected by eight high-sensitivity 
Doppler surveys. The three regions labeled "Detected", "Being Detected" and "Not De- 
tected" indicate the selection effects in the log mass - log period plane due to a limited time 
of observation and limited radial velocity sensitivity (Section 2.1). The rectangular region 
circumscribed by the thin solid line, fits almost entirely into the "Detected" region, and 
contains what we call the less-biased sub-sample. This sub-sample is the basis of the trends 
in mass and period identified in Fig. 5. Metallicity of the host stars are indicated by the 
shade of gray of the point while eccentricity of the exoplanet orbits are indicated by point 
size (see key in lower left). Thin lines connect planets orbiting the same host star. The 
cross-hatched square is our Jupiter-like region defined by M Saturn < Msin % < 3M Jupiter and 
P <" P < P 

r asteroids _ r _ r Saturn- 
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2. Exoplanet Data 

2.1. Mass-period Distribution 

Figure 1 displays the masses and periods of the 106 exoplanets detected as of June 2003 
by the eight high precision Doppler surveys analyzed in this paper. The region in the upper 
left labeled "Detected" is our estimate of the region in which the Doppler technique has 
detected virtually all the exoplanets with periods less than three years orbiting target stars 
that have been monitored for at least three years. This region is bounded by three days 
on the left, three years on the right, 13Mj upiter on the top and at the bottom by a radial 
velocity of 40 m/s induced in a solar-mass host star. The largest observed exoplanet period 
and the smallest observed radial velocity induced by a detected exoplanet are used to define 
the boundary between the "Being Detected" and "Not Detected" regions. The discontinuity 
of the "Being Detected" region near Jupiter is due to the increased sensitivity of the original 
Lick survey at the end of 1994. No more exoplanets should be detected in the "Detected" 
region unless new stars are added to the target lists. Thus, all the exoplanets marked with 
a '+' (detected since August 2002) should fall in, or near, the "Being Detected" region. Of 
the 11 new detections since August 2002, 8 fall in the "Being Detected" region, 2 fall just 
inside the "Detected" region while one has P < 3 days - it was not being monitored with 
sufficient phase coverage for detection until recently. 

We define a less-biased sample of planet hosts as the 49 hosts to the planets within the 
rectangular region circumscribed by a thin solid line (3 days < P < 3 years and 0.84 < 
,^ sm 1 < 13) in Fig. 1. This rectangle is predominantly in the "Detected" region. We will 

Jupiter 

use this less-biased sample as the basis for our extrapolations (Sec. 4). As a completeness 
correction for the lower right corner of this rectangle being in the 'Being Detected' region, 
we add 4 planets as described in (Lineweaver et al. 2003) giving us a corrected less-biased 
sample of 53(= 49 + 4) planets. 



2.2. Increasing Fraction as a function of monitoring duration 

The exoplanets plotted in Fig. 1 are the combined detections of eight Doppler surveys 
currently monitoring the radial velocities of « 1812 nearby FGK stars (Tables 3 & 4). The 
top panel of Figure 2 shows how the number of these target stars has increased over the past 
16 years. 94 stars have been found to host 106 exoplanets. Of these 94, 92 fall within our 
selection criteria of Sun-like stars (= FGK class IV or V) with planets (Msin % < 13Mj up i ter ). 
Six known exoplanet hosts were not included in this analysis because they were found in the 
context of surveys whose search strategies and sensitivities cannot easily be combined with 
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results from the 8 sensitive Doppler surveys analyzed here. 

In Fig. 2, the date-of-first-monitoring and the discovery date for the detected exoplanets 
were largely obtained from the literature and press releases (Table 3). In some cases the date- 
of-first-monitoring was estimated from the first point on the planet host's velocity curve. The 
ramp-up time needed to start observing all the stars in a survey's target list was estimated 
from the distribution of the date-of-first-montitoring of a survey's detected exoplanets. That 
is, the ramp up time needed to start observing the detected hosts was used to estimate the 
ramp-up time needed to start observing all the stars on a survey's target list. The exceptions 
to this were the time dependence of the original Lick target list obtained from Cumming 
et al. (1999) and the time dependence of the Keck target list (FGK) estimated from the 
FGKM histogram of Cumming et al. (2003). 

The date-of-first-monitoring binning (middle-gray histogram in the top panel of Fig. 
2) shows that the fraction of target stars hosting planets has a maximum of ~ 11% from 
the longest monitored targets and then decreases as we average in target stars that have 
been monitored for shorter times. The two binning conventions used (date-of-detection and 
date-of-first-monitoring) are equivalent today at the current value of 5%(w 92/1812). 

Although this 11% estimate is based on the results from ~ 85 target stars of the two 
longest running surveys: Lick and McDonald, it is consistent with the increasing fraction 
based on the monitoring duration of all target stars. This is shown in Fig. 3. We fit a 
curve to this data normalized at a duration of 15 years to the weighted average fraction of 
the two longest duration bins (Sec. 4.3). The extrapolation of this curve (based on current 
sensitivity) to monitoring durations of 30 years (approximate period of Saturn) yields a 
fraction of ~ 15%. This extrapolation corresponds to extending the white "Detected" region 
in Fig. 1 to the upper right, above the diagonal K = 40 m/s line. 

In the lower panel of Fig. 3, we increase the fraction for a given group of stars, not 
when a planet is reported, but at the duration equal to the period of the newly detected 
planet. For example, recent detections of short period planets increase the fraction at a 
duration corresponding to the short period of the newly detected planet. This smooths over 
artificial delays associated, for example, with not analyzing the data for the first 8 years of 
observations, and enables us to trace with dotted lines the increasing fraction for each group 
of target stars. 
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Fig. 2. The fraction of stars with planets. TOP: Histograms of the increasing number of target stars and the increasing 

number of them found to be hosting at least one exoplanet. The host stars are binned in two ways. In the darkest histogram 
they are binned at the date of detection of the first exoplanet found orbiting the star. In the middle-gray histogram they are 
binned at the date of first monitoring of the star. Notice the small number of target stars until the discovery of the first planet 
(Mayor & Queloz 1995). BOTTOM: By taking the ratio, in the top panel, of the darker histograms to the lightest one, we 
obtain the fraction of target stars hosting at least one planet. The two binning conventions lead to different results. Using the 
date-of-dctection binning (darkest histogram) yields the intuitive result that the fraction of host stars starts at zero in 1995 
and climbs monotonically until its current value of 5 ± 1% (~ 92/1812). The datc-of-first-monitoring binning starts on the far 
left at 11 ± 3% from the bin of target stars that has been monitored the longest. The fraction decreases as we average in more 
stars that have been monitored for shorter durations. The dashed line is the result of a linear fit to a sparse, and therefore 
more independent, sample of the non-independent points (data points used have a larger point size). This line yields 9 ± 2% 
on the far left in good agreement with the single data point in the left-most bin. The last bin on the right is the same in both 
binning methods and includes all Sun-like target stars. Thus, as 5% is an average fraction from target stars that have been 
monitored for times varying between and 16 years, while Ri 11% is the fraction from target stars that have been monitored 
the longest (fa 15 yrs). The non-trivial task of estimating the total number of stars being monitored is described in Appendix 
A and summarized in Table 4. 
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Fig. 3. Number of targets as a function of how many years they have been monitored. Most targets have only been 

monitored for ~ 5 years. The longer a group of target stars has been monitored, the larger the fraction of planet-possessing 
targets. TOP: Notice the small number of target stars (~ 90) in the original Doppler surveys (bins on far right) and the ~ 2 
years it took to start monitoring all of them. These ~ 90 (~ 5% of the stars currently being monitored) are the only target stars 
that have been monitored long enough to begin to detect exoplanets with Jupitcr-like masses in Jupiter-like (~ 12 year) orbital 
periods. This sample in the two longest duration bins has resulted in 9 detections from 85 targets for a fraction of as 11%. It 
will be another five years before a substantially larger fraction of target stars have been monitored long enough to detect such 
planets. BOTTOM: The fractions plotted here are the ratios of the histograms in the upper panel as in the previous figure. 
This panel shows the effect of longer monitoring duration on the observed fraction of stars with planets. The fraction of target 
stars with detected planets increases with duration. This is distinct from the cumulative fractions shown in the lower panel of 
Fig. 2. However, the right-most bin here is the same as the left-most bin there. Calculation of the fraction of stars hosting 
exoplanets (the curve in the lower panel with the light gray 68% confidence region) is described in Sec. 4.3. The scatter of 
the data points around the smooth curve is due to small number statistics, varying instrumental sensitivities and variation in 
observational phase coverage. For example, the two points at durations of 7.5 and 8.5 years lie below the curve, due to the 
lower average sensitivity > 10 m/s of AFOE and Elodie surveys in these bins. The high point at 6.5 years is the start of the 
sensitive Keck survey. Thus, this plot also shows the effect of different instrumental sensitivities on the fraction of stars hosting 
planets. 
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Fig. 4. When stellar activity is low, high measurement precision is possible and a higher fraction of targets are found to 

host planets. TOP: The number of targets in the original Lick survey that have been monitored for a duration of 14-16 years 
and that have an excess radial velocity dispersion cr' v less than the value on the x axis. c' v is a measure of how precisely one can 
determine the radial velocity of the targets. The number of detections from these targets is also shown. The ratio (BOTTOM) 
gives the fraction of stars with planets as a function of a' v threshold. The a' v of the Lick target stars have been estimated from 
the stellar rotational periods tabulated by Cumming et al. (1999) using the relations <j' v = 10 * (12/Prot) 1 ' 1 for G and K stars 
and a' v = 10 * (10/P ro t) 1,3 for F stars (Saar et al. 1998). The curve is a power law (df/dcr' v oc u' v "') fit to a sparse sample 
of these non-independent points (larger point size). Thus, as we select for higher Doppler precision in the Lick sample, the 
fraction of targets possessing planets increases from 15% to 25% for the stars with the highest precision (cr' v < 2.5 m/s). Small 
number statistics increases the error bars as the decreasing <j' v threshold reduces the number of targets. In this cumulative plot 
the error bars are highly correlated. 
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3. High Doppler Precision Targets 

Instrumental sensitivity is an important limit on a Doppler survey's ability to detect 
planets. However, the level of stellar activity on a star's surface is also important. Using the 
Doppler technique, planets are more easily detected around slowly rotating stars with low 
level chromospheric activity, little granulation or convective inhomogeneities and few time- 
dependent surface features. To reduce these problems, some target lists have been selected 
for high Doppler precision (low stellar activity) by excluding targets with high values of 
projected rotational velocity v sin i or chromospheric emission ratio R' HK (Table 3). 

By selecting target stars monitored the longest, the detected fraction of Sun-like stars 
possessing planets increased from 5% to 11%. By selecting from the target stars monitored 
the longest, the stars with the highest Doppler precision, the fraction increases still further. 
Fischer et al. (2003) analyzed a group of high Doppler precision target stars in the original 
Lick sample and found that 15% possessed planets. We extend this idea in Fig. 4 by plotting 
the fraction of target stars with planets as a function of their stellar activity as measured 
by their excess radial velocity dispersion a' v (Saar et al. 1998). We find that this selection 
increases the fraction from 15% on the far right of Fig. 4 to ~ 25% on the far left for the 
highest Doppler precision stars. As long as high precision Doppler stars are an unbiased 
sample of Sun-like stars this indicates that at least ~ 25% of Sun-like stars possess planets. 

However, as we decrease the a' v threshold to consider only the highest precision stars, 
we are using fewer target stars to infer the fraction. Thus, the error bars on the estimates 
increase from 15 ± 5% on the right to 25 ± 15% on the left. 



4. Fitting for and extrapolating trends 

We use extrapolation to estimate the fraction of stars with planets within regions of 
mass-period parameter space larger than the less-biased sample (Fig. 1). Since there is 
no accepted theoretical model for the functional form for the mass or period distribution 
functions, we make simple linear fits to the histograms in log Msin i and log P and we fit 
conventional power laws to the histograms in Msin % and P (Fig. 5). 
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Fig. 5. Trends that we extrapolate to lower mass and longer periods. Histograms of planet mass (top) and period 

(bottom) for the less-biased sample of cxoplancts within the thin solid rectangle of Fig. 1, compared to all cxoplanct detections. 
Both log (left) and linear (right) versions are shown. Histograms in log Msin i and log P (panels A & B) are fitted with the 
linear functional forms dN / d(log M sini) = a logMsini + c and dN/d(logP) = b logP + d where the best fits to the histograms 
yield a = —30 ± 7, and b = 19 ± 4. The linearly binned histograms of Msin i and P (panels C & D) are fitted to the functional 
forms: dN/d(Msin i) oc (Msin i) a and dN/dP oc Pi 3 respectively and wc find a = -1.7 ±0.2 and (3 = -0.4 ±0.2. To check the 
robustness of the fits we change variables to log Msin i and log P (effectively producing a re-binning of the data) and fit for a 
and /3 in panels A & B. We find a = —1.9 ±0.2 and (3 = —0.2 ±0.2. Combining these two estimates gives our best estimates of 
a = —1.8 ± 0.3 and f} = — 0.3^q'4. In the fit of panel B, we ignore the 5 exoplanets in the smallest period bin because we are 
interested in the overall pattern that can be most plausibly extrapolated to larger P bins, not in the pile up associated with a 
poorly understood stopping mechanism at P < 12 days. For consistency, these 5 exoplanets have also been removed from the 
smallest period bin in panel D. 
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Fitting dN/d(logMsini) = a logMsini + c to the histogram of log Msin i (Fig. 5 A) 
we obtain the slope a = —30 ± 7. Fitting dN/ d(logP) = b logP + d to the histogram of log 
P (Fig. 5B) we obtain the slope b = 19 ± 4. These trends are shown as thick lines. The 
distribution of exoplanets is not flat in either log Msin i or log P. That is a = —30 ± 7 
is significantly different (~ 4a) from a = and 6 = 19 ± 4 is significantly different (~ 5a) 
from 6 = 0. Fitting dN/d(Msini) oc (Msini) a to the histogram of Msin i (Fig. 5C) we 
obtain a = -1.7 ± 0.2. Fitting dN/dP oc P? to the histogram of P (Fig. 5D) we obtain 
P = -0.4 ±0.2. 

Since d (In x) = dx/x, the functional form dN/d(Msini) oc (Msini) a can be writ- 
ten as dN/d(logMsin i) oc (Msin i)^ 1+ °^ and similarly dN/dP oc P 13 can be written as 
dN / d(logP) oc p( 1+l3 \ We fit these functions to the histograms of log Msin i and log P 
in Fig. 5A & B where we find a = -1.9 ± 0.2 and (3 = -0.2 ± 0.2. These values differ 
by ~ la from the values of a and (3 found in Fig. 5C & D. We attribute the differences 
to the different log and linear binning schemes. We combine the two values into our best 
estimates: a = —1.8 ± 0.3 and (3 = — 0.3^0^- These values are compared to other estimates 
in Table 1. If the distributions were flat in log Msin i and log P we would find respectively 
a ~ —1.0 and (3 ~ —1.0. However, both the trend in mass (a) and in period (f3) are signifi- 
cantly different (~ 2a) from flat. These slopes agree with our previous results (Table 1) and 
show that the evidence supporting the idea that Jupiter lies in a region of parameter space 
densely occupied by exoplanets, has gotten stronger in the sense that our new value b = 19 
is larger than our previous estimate and our new values for a and a are more negative than 
our previous estimates. 

The main differences between our results and previous results is that we obtain a steeper 
slope (more negative a) when we fit the funtional form dN/d(Msin i) oc (Msin i) a to the 
mass histogram of the data: we get a « —1.8 while other analyses get a « —1.1. Thus 
we predict more low mass planets relative to the more easily detected number of high mass 
planets, than do other analyses. We obtain a more shallow slope (less negative f3) when we fit 
the funtional form dN/dP oc P 13 to the period histogram of the data. We get (3 ~ —0.3 while 
other analyses get (3 ~ —0.8. Thus, we predict more hosts of long period planets relative 
to the more easily detected short period planets, than do other analyses. These differences 
are largely due to the differences in the way the incompleteness in the lowest mass bin and 
the longest period bin are accounted for. This can be seen, for example in the a = —0.7 
reported by Marcy et al. (2003) when no correction is made for incompleteness in the lowest 
mass bin. When we make no completeness correction and include the lowest mass bin, we 
reproduce their result. 
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4.1. Extrapolation Using Discrete Bins 

Based on slopes a and b, we make predictions for the fraction of Sun-like stars with 
planets within two regions of the log mass - log period plane. The estimated populations 
of the two lowest mass bins based on the slope a are indicated in Fig. 5A. Similarly the 
estimated populations of the two largest period bins based on slope b are shown in Fig. 5B. 
From the less-biased region (thin rectangle in Fig. 1) we extrapolate both in Msin i and P 
by 1 bin. When we extrapolate in Msin i or P we are doing so at a fixed range in P or Msin i 
respectively. However to estimate the fraction of stars within the thick rectangle in Fig. 1 
(given by the Msin i and P ranges 0.3 < Msin i/M Jupiter < 13 and 3 days < P < 13 years) 
we need to estimate the number of planets NMPext i n the bottom right-hand region (just 
below Jupiter, see Fig. 1). To do this we use Eq. 1 (see Fig. 6). We have iVj& = 53 ± 2 from 
Fig. 1 and Nuext = 41 ± 7 and Np ext = 39 ± 6 from Fig. 5A&B. Inserting these values into, 

^Mext ^ N M Pext 

N lb ~ N Pext ' 1 } 

(see Fig. 6) we obtain N M p e xt = 30 ± 9. Thus, N hosts = 163 ± 24 and the fraction /, of 
targets hosting planets is the ratio, 

J ~ TT » K Z ) 

1 ' targets 

where N targets ~ 1812 ±103 (Table 4). Finally we find / = (163±24)/1812± 103) « 9±1%. 
Thus, using the slopes a and b to extrapolate one bin into lower masses and longer periods 
(thick solid rectangle in Fig. 1) we find that 9 ± 1% of the targeted Sun-like stars have 
planets. 

Similarly and more speculatively we estimate the fraction contained within the thick 
dashed rectangle in Fig. 1 by extrapolating one more bin in mass (over the same range 
in period as the previous mass extrapolation) and by extrapolating one more bin in period 
(over the same range in mass as the previous period extrapolation). The analogous numbers 
are N lb = 53 ± 2, N Mext = (41 ± 7) + (53 ± 10), N Pext = (39 ± 6) + (52 ± 9) (where the 
sum of the 2 sets of numbers is the sum of the 2 separate bins). Equation 1 then yields 
Nj^Pext — 161 ± 50. Summing the four contributions as before yields Nh osts = 399 ± 84 
and thus / = (399 ± 84)/1812 ± 103) w 22 ± 5%. Thus, we estimate that 22 ± 5% of the 
monitored Sun-like stars have planets in the larger region (0.1 < Msin i/Mj upiter < 13 and 
3 days < P < 60 years) that encompasses both Jupiter and Saturn. This larger region is less 
than 20% of the area of the log mass - log period plane occupied by our planetary system. 
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Fig. 6. — The simple method of extrapolation used to predict planet numbers in under- 
sampled regions of the log mass - log period plane. The known value of No, (number of 
planet hosts in the less-biased region of Fig. 1) and extrapolations based on the slopes a 
and b were used to derive N Mext and N Pext . We then use Eq. 1 to estimate N MPext . Our 
estimate of the total number of planet hosts is then Nh os ts = AT i6 + NMext + Np ext + Nupext- 



Table 1: Best-fit trends in fits to mass and period histograms and comparison with other 
analyses 



Source 


a 




a 


b 


this paper, Fig. 4 A,B J ' 






-30 ±7 


19 ±4 


" " Fig. 4 C,D fc 


-1.7 ±0.2 


-0.4 ±0.2 






" " Fig. 4 A,B' 


-1.9 ±0.2 


-0.2 ±0.2 






" "combined Fig. 4 A,B,C,D 


-1.8 ±0.3 


-0-3183 






Marcy, Butler, Fischer & Vogt 2003 


_ Q7 m 








Lineweaver, Grether & Hidas 2003 


-1.6 ±0.2 






13 ±4 


Lineweaver & Grether 2002 


-1.5 ±0.2 




-24 ±4 


12 ±3 


Tabachnik & Tremaine 2002 


-1.11 ±0.10 


-0.73 ±0.06 






Stepinski & Black 2001 


-1.15 ±0.01 


-0.98 ±0.01 






Jorissen, Mayor & Udry 2001 


~ -1 








Zucker & Mazeh 2001 


1 









3 best-fit of dN/d(logMsin i) = a logMs'm i + c to histogram of log Msin i and best-fit of dN/d(logP) = b logP + d to 
histogram of log P 

k best-fit of dN/d(Msin i) oc (Msin i) a to histogram of Msin i and best-fit of dN/dP oc PI 3 to histogram of P 

1 best-fit of dN/d(logMsin i) oc (Msin i) 1 + c * to histogram of log Msin i and best-fit of dN '/ 'd(logP) oc P 1 +' 9 to histogram of 

log P 

m including the lowest mass bin with no completeness corrections 
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4.2. Extrapolation Using a Differential Method 

Instead of extrapolating one or two discrete bins, we can generalize to a more flexible 
differential method. For example, we can use the power law functional form to integrate a 
differential fraction within an arbitrary range of Msin % and P (Tabachnik & Tremaine 2002), 

df = c(Msin i) a P? d(Msin i)dP. (3) 

We find the normalization c by inserting the known values from the less-biased area: / = 
(53 ± 2/(1812 ± 103)) = 2.9 ± 0.2%, a = -1.8 ± 0.3 and (3 = -0.3^ (Table 1). We 
integrate between the boundaries of the less-biased rectangle (3 days < P < 3 years and 
0.84 < Mj^ZL < 13 ) and solve for the normalization. We find c = 8.6+5° 8 7 x 10~ 5 . Under the 
assumption that this same c, a and (5 hold over larger regions of parameter space, we can 
integrate over the larger region and solve for /. For the thick solid and thick dashed regions 
shown in Fig. 1, we find / = 201^% and 100ig 9 % respectively. Using this differential method 
with the slopes a and b we find values nearly identical with the discrete bin extrapolations 
based on a and b: f = 9 ± 1% and 22 ± 5% respectively. 

The power-law based estimates are consistent with the lower linear (a— and b— based) 
estimates in the sense that the large error bars on the power-law estimates overlap with the 
lower estimates of the linear method. The reason the power law fits yield larger fractions 
can be seen in Fig 5 A &B. In the low mass and large period regions, the dashed curves 
are higher than the solid lines. Without further data from less massive planets and larger 
periods we interpret this difference as an uncertainty associated with the inability of the data 
to prefer one of the two simple functional forms fit here. The power law fits are marginally 
better fits to the data. We interpret the lower values from the linear fits as conservative 
lower limits to the fraction /. 



4.3. Fractions in Velocity-Period Parameter Space 

The fractions discussed in Sections 2 & 3 are based on exoplanet detections constrained 
by the Doppler technique to a trapezoidal region of the log mass - log period plane defined 
by a minimal velocity K min . The fractions discussed in the previous two sections are based 
on rectangular regions of the log mass - log period plane. In the lower panel of Fig. 3, the 
fractions plotted as a function of duration come from a trapezoidal region (not a rectangular 
region) of the Msin % — P plane. Hence, a fit of dN/dP oc P 13 ' to the P histogram of detected 
planet hosts, will produce a value of f3' slightly different from the (3 of Fig. 5D. We find 
(3' = —0.5 ± 0.2 (while (3 = —0.4 ± 0.2). Such a difference is expected since the number of 
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planets at large P will decrease more steeply in the trapezoidal region, because as P increases 
the "Detected" regions becomes narrower than a region defined by a constant Msin %. The 
curve shown in the lower panel of Fig. 3 is the integral of P@' . It is normalized to the last 
two bins on the right. In plotting this integral we are assuming that a survey of duration P s 
has observed a large fraction of its target stars with Doppler-detectable exoplanets of period 

P<Ps- 

An interesting consistency check of the relationships between our extracted values of a, 
j3 and K min and the limits of integration for Eq. 3 is described in Appendix B. 

4.4. Jupiter-like Planets 

The fraction of Sun-like stars possessing Jupiter-like planets is important since Jupiter 
is the dominant orbiting body in our Solar System and had the most influence on how our 
planetary system formed. Exoplanets with Jupiter-like periods and Jupiter-like masses are 
on the edge of the detectable region of parameter space. The cross-hatched square in Fig. 
1 is our Jupiter-like region defined by orbital periods between the period of the asteroid 
belt and Saturn with masses in the range Msatum < Msin % < 3Mj upiter . We estimate the 
fraction of Sun-like stars hosting planets in this region using the differential method based 
on our best-fit values of a and b. Integrating between the limits of the cross-hatched area we 
obtain / = 5 ± 2% of Sun-like stars possess such a Jupiter-like planet. Using the differential 
method based on our best-fit values for a and (5 we obtain / = 28^21%- As with the different 
/ values resulting from these two methods in Sec. 4.2, these are consistent with each other 
and reflect the different functional forms used. 



Table 2: Fraction Comparison (see Fig. 7) 



Source 


Msin i range 


Period range 


Fraction 


Our Fraction" 


Our Fraction 6 


TT C 2002 
Liu et al. 2002 
Armitage et al. 2002 
TT C 2002 


1 - 10 M Jup 
0.25-15 M Jup 

0.3-10 M Jup 
0.003-10 M Jup 


2 days - 10 yrs 
0-8 yrs (< 4 AU) 
0.03-11.2 yrs d 
2 days - 10 yrs 


4% 
6% 
15% 
18% 


4±1% 
9 ±3% 
9 ±3% 
45 + Jjj% 


6 +3 2 % 
18 +1 7 9 % 
19±fg% 

100% 



a differential method based on slopes a and b 
b differential method based on powers a and /3 
c Tabachnik & Tremaine (2002) 
d corresponds to 0.1 - 5 AU 
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Semimojor Axis [AU] 




_ _ _ This Paper - 22% 




10° 10 1 10 2 10 3 10 4 10 5 

P Period [doys] 

Fig. 7. — We estimate that at least 9% of Sun-like stars have planets within the thick solid 
rectangle and at least 22% have planets within the thick dashed rectangle (Sec. 4.1). We 
find larger values using a differential method based on power- law fits (Sec. 4.2). Here, we 
compare our results with other published estimates of the fraction of stars with planets. 
The papers, estimates and the regions of the log Msin % - log P plane associated with these 
estimates are indicated. Our results are consistent with, but generally higher than, previous 
work (Table 2). 
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5. Summary and Discussion 

We have analyzed the results of eight Doppler surveys to help answer the question: 
'What fraction of stars have planets?'. We use the number of targets and the number of 
detected planet hosts to estimate the fraction of stars with planets. Quantitatively following 
the consistent increase of this fraction is an important new way to substantiate both current 
and future estimates of this fraction. We show how the naive fraction of ~ 5% increases to 
~ 11% when only long-duration targets are included. We extend the work of Fischer et al. 
(2003) by plotting the fraction as a function of excess dispersion and show how this ~ 11% 
increases to ~ 25% when only long-duration Lick targets with the lowest excess in radial 
velocity dispersion are considered. 

We have identified trends in the exoplanet data based on a less-biased sub-sample. 
We find stronger support than found previously for the idea that Jupiter -like planets are 
common in planetary systems (Table 1). We estimate the fraction of Sun-like stars hosting 
planets in a well-defined Jupiter-like region to be ~ 5%. 

We have extrapolated these trends into unsampled or undersampled regions of the log 
mass - log period plane. We find at least ~ 9% of target stars will be found to host an 
exoplanet within the thick rectangle of Fig. 1 and that more speculatively at least ~ 22% of 
target stars will be found to host an exoplanet within the larger thick dashed rectangle. Our 
results for the fraction of Sun-like stars with planets are consistent with but are, in general, 
larger than previous estimates (Table 2, Fig. 7). 

The largest uncertainty in this analysis is that we may be extrapolating trends derived 
from a small region of log mass - log period into regions of parameter space in which the 
trends are slightly or substantially different. This uncertainty is why we did not extrapolate 
beyond the dashed region in Fig. 1 that contains from our Solar System, only Jupiter and 
Saturn. 

It is sometimes implicitly assumed that most planetary systems will be like ours and 
that Earth-like planets will be common in the Universe. However, as we descend in scale 
from galaxy, to star, to planetary system to terrestrial planet we run more of a risk of 
self-selection. That is, the factors that are responsible for our origin may have selected a 
non-typical location. Thus, answering the question "What fraction of stars have planets?" 
must rely on the continued analysis of the statistical distributions of exoplanets detected by 
the increasingly precise and ground-breaking Doppler surveys. 

The hypothesis that ~ 100% of stars have planets is consistent with both the observed 
exoplanet data which probes only the high-mass, close-orb itting exoplanets and with the 
observed frequency of circumstellar disks in both single and binary stars. The observed 
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fractions / that we have derived from current exoplanet data are lower limits that are 
consistent with a true fraction of stars with planets f t , in the range 0.25 < f t < 1. If the 
fraction of Sun-like stars that possess planets is representative of all stars, our result means 
that out of the ~ 300 billion stars in our Galaxy there are between ~ 75 and ~ 300 billion 
planetary systems. 
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7. Appendix A: Estimating Target List Overlap 

To compute the total number of target stars being monitored (Fig. 2) we need to avoid 
double-counting. Target lists of six of the eight surveys considered here have been published, 
thus allowing the overlap in the FGK targets of these six to be eliminated by comparing 
target star names (Table 3). The total number monitored by these six surveys is then known 
(Nknown ~ 1124). The total number of targets monitored by the Coralie and Elodie surveys 
is known (N c = 1100, N E = 350) but without published target lists the extent of overlap 
with the other surveys can only be estimated. We do this by using the statistics of duplicate 
detections. Let the total number of exoplanet hosts discovered or confirmed by Elodie 
and Coralie be NEhosts = 16 and Nchosts = 33 respectively. The number of these planet 
hosts that were discovered or confirmed by any of the other 6 surveys are NEoverlap = 8 
and Ncoveriap — 10. Since there is no overlap between the Coralie and Elodie surveys 
(Udry private communication), these two sets of planet hosts are mutually exclusive. The 
fractional overlap of detections/confirmations is qe = NEoverlap/ NEhosts — 8/16 = 0.50 and 
gc = Nc overlap /Nc hosts = 10/33 = 0.30 (Tables 3 & 4). We use these fractional detection 
overlaps as estimates of the fractional target list overlaps. Thus, we estimate the total 
number of monitored targets (excluding overlap) as: 

Ntotai ~ N known + (1 - g c ) N c + (1 - g E ) N E . (4) 

However, estimated this way, N total includes target stars that were found to be single line 
spectroscopic binaries (SB1). Exoplanet detection is difficult in such systems so we correct 
for this by substracting the estimated fractions of SBl's in the various surveys (between 3% 
and 9%, see Tables 3 & 4). After this last step we find that N total « 1812 ± 103. The time 
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Table 3: Doppler Surveys" - Targets 





A AT 


Lick 


Keck 


Coralic 


Elodie 


AFOE 


CES 


McDonald 


Targets 5 


204 


360 


600 c 


1100 a 


350 


146 


37 


33 


FGK IV, V Targets 


198 


360 


443 


1100 


350 


136 


32 


33 


List Published 


Y 


Y' 


Y" 


N 


N 


Yf 


Y 


Y 


Year Started 


1998 


1987 


1996 


1998 


1994 


1995 


1992 


1987 


Spectral Types 


FGKM 


F7-K0 


F7-M5 


F8-M0 


FGK 


FGK 


F8-M5 


FGK 


Apparent V 


£ 7-5 


5 7 -5 


< 11 


<9 (for 80%) 


<7.65 


< 7.5 


< 6 


< 6 


L °g(«'HJf) 9 


< -4.5 


< -4.5 


< -4.5 












vsini (km/s)' 1 








<4 d 


< 5 


< 8 






Detected SBl's % 4 


8.8 


3.3 


3.3 


8.7 


3.5 


3.5 


8.1 


3.5 


Planet Host Detections^ 


18 


18 


26 


33 


16 


6 


2 


3 


References^ 


1 


2,9,10 


3,9,11 


4 


5,12 


6 


7 


8 



a AAT - Anglo Australian Observatory, Anglo Australian Telescope, UCLES Spectrograph 
Lick - Lick Observatory, Hamilton Spectrograph 
Keck - Keck Observatory, HIRES Spectrograph 

Coralie - European Southern Observatory, Euler Swiss Telescope, Coralic Spectrograph 
Elodie - Haute Provence Observatory, Elodie Spectrograph 
AFOE - Whipple Observatory, AFOE Spectrograph 

CES - European Southern Observatory, CAT Telescope, CES Spectrograph 
McDonald - McDonald Observatory, Coude Spectrograph 
b refers to all target stars in a survey 

c another 450 stars have one or more observations but have been dropped for various reasons (Butler et al. 2002) 

d there are 550 faster rotators in a lower priority target list (Udry 2003, private communication) 

e combined Lick/Keck (889 stars) 

f Korzcnnik 2003 private communication 

9 the fractional Call H and K flux corrected for the photosphcric flux (see Noyes 1984 and Saar et al. 1998) 
h projected rotational velocity 

' SB1: single-lined spectroscopic binaries. The AAT survey (Jones et al. 2002) finds 18 out of 204 target stars are SB1. 
(Nidever et al. 2002) mentions that 29 out of 889 Lick and Keck target stars arc SB1. (Endl et al. 2002) finds 3 out of 37. 
The original Lick survey has 5 out of 74 (Cumming et al. 1999). We estimate the fraction of SBl's for the other surveys by 
noting a survey as either northern or southern hemipshcre and taking an average of the known surveys in that hemisphere to 
estimate the fraction of SBl's. SB2's have been eliminated from the target lists. All surveys exclude binary stars when the 
angular separation is less than 2 . 

3 including confirmations but excluding confirmations of hosts that were known to have a planet prior to the start of 
observation. Thus several hosts of Lick and Elodie that are monitored by Keck and Coralie to increase phase coverage are only 
included in the Lick and Elodie numbers. 

k 1) Jones et al. 2002 2) Fischer et al. 2003 3) Butler et al. 2002 4) Udry et al. 2000. The six newest hosts discovered by 
Coralie and included here were announced during the XlXth IAP Colloquium "Extrasolar Planets: Today and Tomorrow", 
Paris, June 2003 and are in preparation: Mayor et al. 2003b, Zucker et al. 2003 and Udry et al. 2003. 5) Sivan et al. 2000 6) 
Korzcnnik 2003, private comm. 7) Endl et al. 2002 8) Cochran & Hatzes 1993 9) Nidcvcr et al. 2002 10) Fischer et al. 1999 
11) Vogt et al. 2000 12) Mayor et al. website 2003a 
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dependence of N tota i (Fig. 2, top panel) is taken from the time dependence of Nknown and 
from N c and N E (Table 3) using the approximation that go and g E are constants. Our 
estimates of the error associated with this procedure are indicated by the error bars on each 
bin in Figs. 2 & 3. 



8. Appendix B: Msin % — P to K — P parameter space 

We can use the fraction 11% at a duration of 15 years in the lower panel of Fig. 3 
as well as the known quantities P ma x, K m i n to perform an interesting consistency check of 
the relationships between them and the extracted values of a and f3. Assuming Mh ost R3 
M q » M and low eccentricity, the relation between Msin i and the semiamplitude K of 
radial velocity is 

, Msin i , . 

K « A ~pT/3~> (5) 

where A = 200 when K is expressed in units of m/s, Msin % in Mj and P in days. We are 
only interested in exoplanets with Msin % < 13Mj up i ter (~ the deuterium burning limit). 
Using Eq. 5 to change variables from Msin i to K we can write, 

df = c(Msin i) a P p d(Msin i) dP (6) 

(Eq. 3) as, 

d f = J^a) Kap ^ (1+a) dK dR (7) 

Under the assumption that Msin i is uncorrelated with P and that a and (5 are approximately 
constant within the region of interest, we can integrate between arbitrary limits. The upper 
limit on K depends on period: K max (P) = 13 A/ 'P 1 / 3 from Eq. 5. We then have, 



/ = 



d + a)A( 1+a ) 



[UAf +a ^ [ Pmax pHp-k^ [ Pmax pP+k^dP 

Jo Jo 



(8) 



Using our best estimates a = —1.8, (3 = —0.3, and c = 8.6 x 10~ 5 and / = 0.106 from 
the lower panel in Fig. 3 at 15 years, we integrate Eq. 8 and solve for K m i n . We find 
Kmin = I6I4 m /s- The weighted average internal error of the original Lick and McDonald 
observing programs that contain the 85 stars in these two bins is a = Q — lOm/s. The 
minimum signal to noise of exoplanet detections is ~ 3 so we expect K min R3 3 a which is 
indeed the case. 
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Table 4: Doppler Surveys - Cumulative Numbers of FGK IV-V Targets as a Function of 
Time 



Survcy(<7 m/s)° 


1988 6 


1989 


1990 


1991 


1992 


1993 


1994 


1995 


1996 


1997 


1998 


1999 


2000 


2001 


2002 


2003 


Lick 10 - 15 


44 


56 


62 


64 


66 


67 


73 




















Lick 3-5 
















74 


74 


74 


265 


360 


360 


360 


360 


360 


McDonald 10 - 20 


22 


33 


33 




























McDonald 5 - 10 








33 


33 


33 


33 


33 


33 


33 


33 












McDonald ~ 3 
























33 


33 


33 


33 


33 


CES 8-15 
















29 


29 


29 


29 


32 


32 


32 


32 


32 


32 


32 


Blodie ~ 10 





















142 


142 


159 


315 


350 


350 


350 


350 


350 


AFOE ~ 10 





















23 


90 


113 


136 


136 


136 


136 


136 


136 


Keck 2 - 5 
























8 


179 


270 


326 


360 


399 


443 


443 


A AT ~ 3 






























66 


198 


198 


198 


198 


198 


Coralic ~ 3 






























33 


700 


1000 


1100 


1100 


1100 


Cumulative 


66 


89 


95 


97 


99 


129 


135 


301 


376 


590 


1150 


2135 


2469 


2608 


2652 


2652 


no ovcrlap c 


63 


85 


87 


87 


89 


111 


111 


210 


263 


413 


790 


1465 


1688 


1782 


1812 


1812 


non-cumulative 


66 


23 


6 


2 


2 


30 


6 


166 


75 


214 


560 


985 


334 


139 


44 





no overlap c 


63 


22 


2 





2 


22 





99 


53 


150 


377 


675 


223 


94 


30 






a internal error also known as instrument sensitivity 

b binning is from July to June with January at the center of bin. 

c values have been corrected for overlapping target lists and for estimated numbers of single line spectroscopic binaries (SB1). 
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